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Introduction 

This study focuses on enhancing the validity of foreign 
language (L2) reading comprehension assessment measures. In 
recent years, increased discussion has ensued regarding the 
validity of L2 reading tests. One facet of the assessment debate 
centers on the decrease in construct validity that results when 
readers are required to write their responses on L2 reading 
measures in the L2 . A number of studies have demonstrated that 
such tests reveal less evidence of comprehension than do those 
that call for responses in the native language (LI) (Hock & Poh, 
1979; Lee, 1986; Shohamy, 1984). According to Weber (1991), when 
L2 readers respond "in their weaker second language, they may be 
limited in their ability to demonstrate their level of 
understanding" (p. i09) . Thus, the practice of requiring 
responses in the L2 results in a confounding of the productive 
skill of writing and the receptive skill of reading. 

Although L2 reading tests calling for Ll responses yield 
scores that more accurately represent reading comprehension 
(Bernhardt, 1991; Hock & Poh, 1979; Lee, 1986; Shohamy, i984; 
Weber, 1991) , the reception-production dilemma may still influence 
test scores. m this case, the fusion of Ll writing and L2 
reading could result in scores representing a blend of both 
factors, thus decreasing test validity. 



The purpose of the present study was to enhance L2 reading 
test validity by measuring the unique contributions of selected LI 
and L2 literacy variables- -LI writing ability, LI reading 
comprehension, L2 vocabulary knowledge, and L2 gran^atical skill- 
to the scores on L2 reading comprehension tests. Additionally, it 
has investigated whether or not these contributions differ by 
course level . 

Theoretical and Empirical Support 
Test T hen-ry 

Test validity "is the most important consideration in test 
evaluation," according to a joint commission of the M,erican 
Psychological Association, the American Educational Research 
Association, and the National Council of Mathematics Educator, (in 
Bernhardt, 1991, p. 192). According to Messick (1988), the 
unifying force of test validity is "empirically grounded construct 

interpretation" (n 35) mh,,n 

IP- JbK Thus, because construct validity serves 

as the cornerstone of test validity, it is impossible for test 
validity to exist without "construct-related evidence" (Messick, 
1988- p. 35) . in order for a test to provide construct validity, 
it must "match as closely as possible what is known about a 
process at any point in time" (Bernhardt, 1991, p. 192). 

An instrument that claims to measure the construct of L2 
reading comprehension, therefore, must tap it in a manner that 
accurately reflects current theory- and research-based knowledge 
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of the L2 reading comprehension process. The accuracy of the 
interpretations and the appropriateness of the decisions that are 
based on the results of L2 reading comprehension measures hinge on 
the degree of construct validity that these tests provide. 

The MeldincT of Literacy Transfer Theory 
and the Reading-Writing Relationship 
Cummins 's (1981) Interdependence Hypothesis works in 
conjunction with reading- writing relationship theory to buttress 
the proposition that LI writing ability constitutes a portion of 
the scores on L2 reading assessment measures. The Interdependence 
Hypothesis asserts that literacy skills can transfer between 
languages, and maintains that experience with either the LI or the 
L2 promotes development of the capacities underlying both 
languages . 

An abundance of research on language transfer supports 
Cummins'B (1981) claims. The studies of Groebel (1980), Carson, 
Carrell, Silberstein, Kroll, and Keuhn (1990), Goldman, Reyes, and 
Varnhagen (1984), Reyes (1987), and Wagner, Spratt, and Ezzaki 
(1989), among many others, indicate that reading skill transfers 
between languages, while the research of Canale, Frenette, and 
Belanger (1988), Edelsky (1982), Jones and Tetroe (1987), Lay 
(1982), Mohan and Lo (1985), and Arndt (1987) point to the 
phenomenon of writing ability transfer. 
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The notion of a reading- writing relationship, like that of 
language transfer, has long been sustained by theory and research. 
Wittrock (1984) and Squire (1984) view reading skill and writing 
ability as employing the same cognitive processes. According to 
Wittrock (1984) , both reading and writing involve "generative 
cognitive processes" (p. 600) that allow the learner to construct 
meaning by relating the text to personal experience. Squire 
(1984) evinces agreement with Wittrock by stating that composing 
and compreb*5nding are "two sides of the .iame basic process" (p. 
581) of constructing and reconstructing ideas, Tierney and 
Pearson (1983) further develop the notion of shared cognitive 
processes in reading and writing through their composing model of 
reading, which asserts that both reading and writing are 
essentially composing processes (in Tierney & Shanahan, 1991) . 

In the research arena, numerous studies clearly point to a 
relationship between reading and writing skills, indicating the 
exis'^ence of overlapping cognitive processes. The studies of 
Carson et al . (1990), Juel, Griffith, and Gough (1986), Langer 
(1984), Schewe and Froese (1986), Shanahan (1984), and Shanahan 
and Lomax (1986, 1988) , among many others, suggest the likelihood 
of shared knowledge in reading and writing. 

In summary, theoretical and empirical evidence point to the 
occurrence of language transfer and to the existence of a 
relationship between reading skill and writing ability. Evidence 

ERIC 



of a reading-writing relationship, coupled with that of the 
transfer of skills between languages, leads to the hypothesis that 
LI literacy abilities contribute to the scores on L2 reading 
tests . 

Th e L2 Reading Comprehension Process 
Bernhardt' s (1986) Constructivist Reading Model contributes a 
valuable theoretical foundation to this study because it takes 
into account the roles in the L2 reading comprehension process of 
two of the literacy variables that were explored: L2 vocabulary 
knowledge and L2 grammatical skill. This six-part model 
integrates both text-based and extratext- based components. The 
text-based elements include phonemic/graphemic features, syntactic 
feature recognition (grammatical ability) , and word racognitibn 
(vocabulary knowledge), while the extratext -based features 
comprise intratextual perceptions, prior knowledge, and 
metacognition. Because the model is non- linear, the six elements 
are free to interact and to meld together in varying combinations 
in order to forge the construct of L2 reading comprehension 
(Bernhardt, 1986) . 

This model is particularly relevant to the present study 
because the unique contributions of its elements to L2 reading 
comprehension remain unspecified. Such flexibility is crucial 
because the contributions may differ by course level, or by a host 
of other variables (Everson, 1986) . The present study has aided 
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in the testing of this model because it has explored the extent to 
which word recognition and syntactic feature recognition 
contribute to L2 reading comprehension scores. 

Methodology 
Subjects 

The subjects included 131 cadets (88 Beginners and 43 Non- 
Beginners) enrolled in French courses at the United States Air 
Force Academy (USAFA) in Colorado Springs, CO. The Beginners were 
enrolled in Basic French. The Non-Beginner group consisted of 21 
intermediate- level and 21 advanced- level subjects. 

Instrumentation 

Independent Variables 

LI Reading Comprehension . 

Two separate testing instruiTients were used to assess LI 
reading comprehension: (a) the comprehension portion of the 
Nelson-Denny Reading Test (NDRT) , Form E (Brown, Bennett, & Hanna, 
1981), and (b) the mean percentage score on three immediate timed 
written recall protocols. The comprehension portion of the NDRT 
consists of eight reading passages, followed by a total of 36 
multiple choice questions. 

The three recall passages, selected from the Encyclopaedia 
Britannica (1984), were each between 200 and 250 words in length, 
and historical in nature. Two trained native English-speaking 
raters scored the protocols using the Johnson system (Johnson, 
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1970) . Interrater reliability, measured using the Pearson 
product -moment correlation procedure, was .94. 
LI Writing Ability . 

This variable was measured using the combined holistic scores 
on the same three recall protocols that were employed to assess LI 
reading comprehension. In order to arrive at the scores for 
writing ability, four trained, experienced raters used the Test of 
Written English Scoring Guide (Educational Testing Service, 1990) . 
Overall interrater reliability, as measured by the Spearman -Brown 
Prophecy Formula (Henning, 1987), was .95. 
L2 Vocabularv Knowledge . 

A supply-definition test was used to measure L2 vocabulary 
knowledge. Two experienced French language instructors from the 
Air Force Academy selected the 50 words appearing in the French 
reading passages (See Dependent Variable: L2 Reading 
Comprehension) that the beginning- level subjects would find the 
most difficult to define. (The beginning level was used as the 
base line.) For each French word, the subjects were required to 
write, in English, either the definition or the English 
equivalent. Two French language experts scored each answer on a 
two-point scale consisting of the following: (a) zero points for 
an incorrect response, (b) one point for a partially correct 
response, and (c) two points for a correct response. Interrater 
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reliability, as measured by the Pearson product -moment correlation 
coefficient, was .99. 
L2 Grammat i na l Skill . 

Scorea for French grammatical skill were obtained from the 
granunar portion of uSAFA's French placement test. This section of 
the test features multiple choice and cloze items. 
Dependent Variable: T,2 Readd ng C omprehend j on 

This variable was measured using the mean percentage scores 
on three immediate free written recall protocols. The three 
French passages, each between 200 and 250 words in length and 
historical in nature, were selected from the EncycHoEaedia 
Universalis (1968) , the French counterpart to the .EncycloEaedia 
Britannica . Texts of the same genre as those used in the Ll ■ 
recall procedure were selected in order to maintain consistency in 
discourse type. Two French language experts scored the protocols 
using the Johnson system (Johnson, 1970) . Interrater reliability, 
measured using the Pearson product -moment correlation coefficient, 
was . 94 . 

Pro cedures and Data C ollection 
Procedures and data collection will be explained in the order 
in which the tests were administered. Data collection took place 
during the fall semester of 1991. 
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NDRT. 

USAFA personnel administer the NDRT to all cadets entering 
the Academy as part of the admissions procedure. Scores were 
obtained through the Department of English, USAFA. 
L2 Grammatical Skill . 

Two months before the recall protocol data were collected, 
the French placement test was administered to the beginning- level 
subjects by members of the USAFA French Division as part of the 
Academy's admissions procedure. The researcher administered the 
grammar portion to the intermediate- and advanced- level subjects 
during class time three weeks before the recall protocol data were 
collected . 

L2 Vocabulary Knowledge . 

Two weeks before the recall prc*-.ocol data were collected, the 
USAFA French instructors assigned the vocabulary test to be 
completed outside of class. The subjects were told that they 
would be under the provisions of the Cadet Honor Code not to use 
dictionaries or to seek help in defining the words. They were 
allowed a maximum of 50 minutes (one minute per word) to complete 
the test. The course instructors collected the tests during the 
following class period. 
Recall Protocols . 

The recall protocols were collected over a four-day period. 
Because the USAFA-wide class schedule is composed of two-day 



cycles, some of the French classes meet the first day of the cycle 
while the others meet the second. The subjects completed the 
recall protocols based on LI passages during the first two-day 
cycle and those based on L2 texts during the second cycle. 

Before the experiment, the researcher used the class rosters 
to determine the number of subjects at each language level, and 
then assembled the appropriate number of research packages. The 
texts were arranged in differing orders in order to limit the 
effects of fatigue. Subjects were selected through the use of a 
table of random numbers (Snedecor & Cochran, 1967) to receive the 
various arrangements. 

When the LI recall protocols were administered, the subjects 
were told that the research packt^ts contained three exercises ' and 
that they would have five minutes to complete each one. The time 
limit was imposed in order to decrease the possibility of the 
subjects memorizing the passages. Additionally, the subjects were 
advised that upon opening the packet to the first passage, they 
would have a total of five minutes to read it and, without 
referring back to it, to write, in English, everything they could 
remember about it. After the first exercise was completed, the 
same procedure was used for the second and third. 

The same procedures were followed when the recall protocols 
based on L2 reading passages were administered, with two 
exceptions. First, the subjects had the entire 50-minute class 



period to read and recall (in English) the French texts. Second, 
because the subjects were working at their own pace, thsy were 
reminded to complete the readings in the order given and that once 
they completed one recall exercise and began another, they could 
not go back. After each subject had completed the three 
protocols, his or her packet was collected. 

Data Analysis 

Six programs of hierarchical multiple regression were 
employed. Individual programs were run for All Subjects, for 
Beginners, and for Non-Beginners. Additionally, because LI 
reading comprehension was measured using two different testing 
instruments, separate programs were performed for each group,, 
first using the NDRT comprehension scores, and then using the* 
recall protocol scores. 

In all of the hierarchical regression programs used, the 
scores for the independent variables were entered into the 
statistical model in the following order: (a) LI reading, (b) LI 
writing, (c) L2 vocabulary, and (d) L2 grammar. The LI variables 
were introduced first because the subjects, all native English 
speakers, were considerably more experienced with the English 
language than they were with the L2 . LI reading data were 
inserted before the LI writing scores because, accordi.ig to 
Shanahan and Lomax (1986) , "more reading information is used in 
writing than vice versa" (p. 122). L2 vocabulary scores were 
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entered before those for L2 grammar because L2 vocabulary skill 
consistently contributed more to L2 reading comprehension scores 
than did L2 grammatical knowledge in the stepwise multiple 
regression analyses performed by Henning (1975) and Koda (1990) . 



Thrae principal findings have emerged from an analysis of the 
means, standard deviations, and minimum and maximum scores. (See 
Tables 1 and 2.) First, the means are higher for Non-Beginners 
than for Beginners on all variables. This dissimilarity for the 
L2 variables is consistent with the course curriculum in which the 
subjects are increasingly exposed to elements of L2 literacy. 
Unexpected however, were the differences for the LI variables'. 
These data suggest that language students opting to continue their 
L2 studies comprise a select group of highly literate individuals. 

Second, for both Beginners and Non-Beginners, the mean NDRT 
comprehension scores tend to be concentrated closer to the maximum 
possible score than are the mean recall scores. Because of the 
cluster of high NDRT comprehension scores, the distributions for 
All Subjects, Beginners, and Non-Beginners were profoundly 
negatively skewed. (See Figure 1,) In contrast, the 
distributions of the recall scores for the three subject groups 
were nearly normal. (See Figure 2,) Unlike the near-normal 
distributions of recall scores, the negatively skewed 



Results and Discussion 



Means and Standard Deviations 
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Table 1; Means an d Standard Deviations for Literacy 
V ariables for Beginners 

N Mean SD Min . Max. 



1 . LI Reading 

a. NDRT 87 25.87 5.97 8 34 

b. Protocols 87 36.44 8.71 18 63 

2. LI Writing 87 10.31 2.24 5 15 

3. L2 Vocabulary 88 9.76 6.34 0 32 

4. L2 Grammar 86 22.07 6.13 1 39 

5. L2 Reading 88 19.73 6.53 7 39 



Table 2: Means and Standard Deviations for Literacy 
Variables for Non-Beoinners 

N Mean SD Min. Max. 



1 . LI Reading 

a. NDRT 39 29.64 3.44 21 35 

b. . Protocols 42 44.19 9.75 25 62 

2. LI Writing 41 12.00 3.22 5 18 

3. L2 Vocabulary 42 39.50 14.91 IQ 81 

54 



4. L2 Grammar 43 42.56 5.65 29 

5. L2 Reading 43 38.53 10.22 21 



62 
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Figure 1. Frequencies of NDRT Comprehension Scores for 
All Subjects 
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Figure 2 . Frequencies of LI Recall Protocol Scores for 
All Subjects 
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distributions of NDRT comprehension scores exhibit minimal 
variability, pointing to limitations in the test's discriminatory 
power. In other words, integrative tests such as the recall 
protocol clearly provide a more sensitive measure of reading 
comprehension . 

Third, the standard deviation of the NDRT comprehension 
scores is smaller for Non-Beginners than it is for Beginners, 
while the opposite is true for the standard deviation of the 
recall scores. In other words, the Non-Beginners' NDRT 
comprehension scores are less variable than those of the 
Beginners, while the Non -Beginners ' recall scores are more 
variable. Thus, as reading skill increases, the greater 
sensitivity and enhanced discriminatory power of integrative tests 
come to the fore . 

Hierarchical Regression Analyses 

All Subjects 

Regardless of the instrument used to assess LI reading 
comprehension, the independent variables contributed in the 
following order of importance: (a) L2 vocabulary, (b) LI reading, 
(c) LI writing, and (d) L2 grammar. (See Tables 3 and 4.) The 
finding that the L2 vocabulary scores contributed more to the L2 
reading scores than did the scores for any other independent 
variable, particularly grammatical skill, is consistent with the 
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Table 3 : Summary of Hierarchical Regression Ai^alysis for 
All Subjects using NDRT Comprehension Scores 

Steps/Variables R Increment Cum. R *- F p 

1. LI Reading .1412 .1412 20.38 .0001 

(126) 

2. LI Writing .0703 .2115 10.70 .0500 

(123) 

3. L2 Vocabulary .3568 .5683 58.27 .0001 

(123) 

4. L2 Grammar .0304 .5987 8.78 .0500 

(121) 

Note. Sample size is indicated in parentheses. 

Table 4 : Summary ot Hierarchical Regression Analysis for 
All Subjects using LI Protocol Scores 

Steps/Variables R 2 increment Cum. r2 f p 

1. LI Reading .2749 .2749 47.77 .0001 

(128) 

2. LI Writing .0384 .3133 6.93 .0200 

(127) 

3. L2 Vocabulary .3091 .6224 99.84 .0001 

(126) 

4. L2 Grammar .0319 .6543 10.97 .0050 

(124) 

Note. Sample size is indicated in parentheses. 
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results of the stepwise regression studies performed by Henning 

(1976) and Koda (1990) . 

Although L2 vocabulary knowledge was the primary contributor, 
the LI variables offered a substantial cumulative contribution in 
both models (21.15% using NDRT comprehension scores and 31.33% 
using LI recall scores). This finding not only bolsters Cummins' s 

(1981) Interdependence Hypothesis, but corroborates the results of 
previous research pointing to the transfer phenomenon between LI 
and L2 reading skills (Carson et al . , 1990; Groebel, 1980; Reyes, 
1987; Wagner et al . , 1989) and to those indicating that reading 
ability and writing skill transfer to one another (Carson et al . , 
1990; Juel et al . , 1986; Langer, 1984; Schewe & Froese, 1986). 

The results for All Subjects also indicate that some of the 
elements in Bernhardt' s (1986) Constructivist reading Model 
contribute more to the reading process than do others. According 
to the data, L2 vocabulary knowledge contributes more to L2 
reading comprehension than does L2 grammatical skill. 
Additionally, prior knowledge in the form of LI literacy abilities 
contributes a great deal to L2 reading. Evidently, language 
students depend upon their LI skills in order to make sense of the 
L2 . 

The most intriguing difference between the results of the two 
regression models is the dissimilarity in the contributions of the 
LI reading scores when assessed using the NDRT comprehension tent 
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versus recall protocols. When recall scores were used, LI reading 
contributed nearly twice the variance (27,49%) as it did when NDRT 
comprehension scores were used (14.12%). This situation came 
about because the more normally distributed recall scores (See 
Figure 2) had more variance to contribute. The fact that the 
overall picture of reading comprehension changes depending on the 
testing instrument used to measure it underscores the need for 
multiple measures. 

Additionally, a greater percentage of the overall variance 
was accounted for in the regression model using recall scores 

(65.43%) versus the one featuring NDRT comprehension scores 

(59.87%), This difference again relates to the greater 
variability in the recall scores. The more sensitive the 
assessment measure is, the more accurate the information will be 
about the components of the L2 reading comprehension process. 
Beginners versus Non-Becrinners using NDRT Comprehension Scores 

For Beginners, the order of importance for the independent 
variables was: (a) L2 vocabulary (8.50%), (b) LI reading (7.06%), 

(c) LI writing (3.47%), and (d) L2 grammar (1.24%). (See Table 
5.) For Non-Beginners, the order was: (a) L2 vocabulary (8.67%), 

(b) LI writing (5.85%), 'c) LI reading (4.42%), and (d) L2 grammar 

(1,49%) , (See Table 6.) 

Consistent with the results for All Subjects, L2 vocabulary 
knowledge contributed more to the L2 reading scores than did any 
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Table 5 : Summary of Hierarchical Regression Analysis for 
Beginners Using NDRT Comprehension Scores 

Steps/Variablt. . Increment Cum. R^ F p 

1. LI Reading .070G ,0706 6.46 .0129 

(87) 

2. LI Writing .0347 .1053 3.22 .0600 

(86) 

3. L2 Vocabulary .0850 .1903 8.61 .0043 

(86) 

4. L2 Grammar .0124 .2027 1.23 .2000 

(84) 

Note. Sample size is indicated in parentheses. 

Table 6 : Summary of Hierarchical Regression Analysis for 
Non-Beginners Using NDRT Comprehension Scores 

2 '~9 

Steps/Variables R Increment Cum. k F p 

1. LI Reading .0442 .0442 1.71 .1992 

(39) 

2. LI Writing .0585 .1027 2.21 .1500 

(37) 

3. L2 Vocabulary .0867 .1894 3.53 .0691 

(37) 

4. L2 Grammar .0149 .2043 .60 .4444 

(37) 

Note. Sample size is indicated in parentheses. 
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of the other independent variables, especiaUy grammatical skill. 
This finding coincides with the results of the regression studies 
performed by Henning (1975) and Koda (1990) . Additionally, the 
cumulative contribution of the LI variables was greater than that 
of the L2 variables for both groups, faiLher reinforcing Cummins' s 
(1981) Interdependence Hypothesis. 

Furthermore, these data aid in expanding Bernhardt 's (1986) 
Constructivist Reading Model by exploring the contributions of 
several of its components by course level. Vocabulary knowledge 
was shown to be a crucial element in the reading comprehension 
process, slightly more so for Non-Beginners (8.67%) than for 
Beginners (8.50%). Additionally, the cumulative contribution of 
the L2 variables was greater for Non -Beginners than for Beginners, 
while the opposite was true for that of the LI variables. 
Evidently, language students rely on prior knowledge in the form 
of LI skills until their L2 abilities develop and thus become 
increasingly useful in the reading comprehension process. 

Both models resulted in nearly equal portions of the total 
variance unaccounted for: 79.73% for Beginners and 79.57% for 
Non-Beginners. Clearly, L2 readers draw upon components of the 
reading process that were not included in the present set of 
independent variables. 
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Beginners vers us Non-Beginners using Recall Protocol Scores 

For Beginners, the independent variables contributed as 
follows: (a) LI reading (10.58%), (b) L2 vocabulary (8.38%), LI 
writing (3.00%), and (d) L2 grammar (1.85%). (See Table 7.) The 
order of importance for Non-Beginners was; (a) LI reading 
(23.96%), (b) L2 vocabulary (5.40%), (c) L2 grammar (2.69%), and 
(d) LI writing (0.43%). (See Table 8.) 

Unlike the regression models for Beginners and Non-Beginners 
using NDRT comprehension scores, LI reading skill (rather than L2 
vocabulary knowledge) was the major contributor for both groups. 
This difference resulted from the increased variability in the 
recall scores, particularly for Non-Beginners. The fact that LI 
reading ability contributed more than twice the variance for Non- 
Beginners (23.96%) as it did for Beginners (10.58%) indicates that 
the sensitivity of the recall protocol increases along with 
language level . 

Consistent with the results of the programs for Beginners and 
Non-Beginners using NDRT comprehension scores, the regression 
models using recall scores show a higher cumulative contribution 
for the LI variables than for the L2 variables. Once again, this 
finding supports Cummins 's (1981) Interdependence Hypothesis. 
Furthermore, and coinciding with the results of the other 
regression models in this study, L2 vocabulary contributed more to 
the L2 reading scores than did L2 grammatical skill. 
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Table 7 : Summary of Hierarchical Regression Analysis for 
Beginners Using LI Protocol Score s 



Steps/Variables R ^ increment Cum. r2 f 

1. LI Reading .1058 .1058 10.06 

(87) 

2. LI Writing .0300 .1358 2.91 

(87) 

3. L2 Vocabulary .0838 .2196 8.91 

(87) 

4. L2 Grammar .0185 .2381 1.95 

(85) 

Note. Sample size is indicated in parentheses. 



P 

.0021 
. 0916 
. 0037 
. 2000 



Table 8 : Summar^/ of Hierarchical Regression Analysis for 
Non-Beginners Using LI Protocol Scores 



Steps/Variables R Increment Cum. R^ F 



1. LI Reading 



.2396 12.60 



,2439 



.2396 
(42) 

2. LI Writing .0043 

(41) 

3. L2 Vocabulary .0540 

(40) 

4. L2 Grammar .0269 

(40) 

Note. Sample size is indicated in parentheses. 



P 

. 0010 



21 .5000 



.2979 2.74 .1500 



.3248 1.38 .2485 
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The regression models using recall scores show 76.19% of the 
overall variance unaccounted for in the data for Beginners and 
67.52%- unaccounted for in the Non-Beginner group. Again, it is 
evident that L2 readers draw on components of the L2 reading 
process that were not included in this study. 

Implications 
Assessment 

In the area of assessment, this study has opened a window on 
the dilemma regarding L2 reading test validity. The results of 
the models for All Subjects show the effect of LI writing ability 
to be small but significant. Thus, although the contribution of 
LI writing ability to the scores on L2 reading tests appears to be 
slight, it should be taken into consideration when interpreting 
test scores. 

These data will lead to enhanced test development and to 
improvements in diagnostic and placement testing. In terms of 
test development, the knowledge that LI writing ability is a 
confounding factor, albeit a minor one, points to the need for 
testing methods that will decrease or eliminate its effects. For 
diagnostic testing, perhaps Ll literacy abilities need to be 
tested in order to gain insight into learner difficulties in L2 
study. Regarding placement testing, Ll literacy abilities, 
especially reading skill, have been identified as important 
predictors of L2 literacy ability. Ll literacy skills, therefore. 



should be considered when placing students into language course 
levels. Additionally, because different results emanate from 
different testing instruments, the use of mult 'pie measures is 
necessary in order to arrive at an accurate picture of what has 
been comprehended. 



In the pedagogical domain, this study attests to the critical 
nature of vocabulary knowledge to L2 reading comprehension. 
Instructors and curriculum developers should continue to stress 
vocabulary skill as a crucial element of L2 study. 

Additionally ; the data underline the importance of 
capitalizing on students' already developed Ll knowledge. 
Instructors should draw attention to parallels and contrasts ' 
between the students' Ll and the L2 , and encourage learners to 
transfer their skills between languages. 

In programs in which the learners are already literate in 
their native languages, the importance of Ll literacy abilities to 
the development of L2 literacy should be increasingly reflected in 
L2 curriculum development. In such programs, beginning- level 
courses should aid students in going from the known to the unknown 
by emphasizing the use of Ll knowledge in L2 study. Follow- on 
courses should guide learners to increased use of their developing 
L2 knowledge. Furthermore, students experiencing difficulties in 
the L2 may benefit from further Ll instruction. 
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Research 

Research is needed that will validate and expand upon the 
results of the present study. Replications should include 
subjects of diverse ages and language backgrounds. Additionally, 
because gross measures of reading and writing were used in the 
present study, replications are needed in which finer assessment 
measures will be employed. Alternate data analysis techniques 
could also be used. The independent variables in the present 
study, for example, should be entered into programs of 
hierarchical multiple regression in varying orders. Qualitative 
methods could be employed. Furthermore, because the variance in 
the L2 reading scores i? still not completely accounted for, 
additional independent variables need to be considered. 

In addition, integrative testing methods, such as the recall 
protocol procedure, need to gain wider acceptance. Not only is 
the recall protocol more sensitive than discrete-point tests, but 
its sensitivity becomes more pronounced as reading proficiency 
increases . 

Most importantly, the results of this study accentuate the 
need for multiple measures. Multiple testing instruments will 
lead to a clearer view of the constructs being measured, and hence 
to more astute research insights and to more perceptive theory 
building . 
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